FlowBoost - Appearance learning from sparsely annotated video
نویسندگان
چکیده
We propose a new learning method which exploits temporal consistency to successfully learn a complex appearance model from a sparsely labeled training video. Our approach consists in iteratively improving an appearancebased model built with a Boosting procedure, and the reconstruction of trajectories corresponding to the motion of multiple targets. We demonstrate the efficiency of our procedure on pedestrian detection in videos and cell detection in microscopy image sequences. In both cases, our method is demonstrated to reduce the labeling requirement by one to two orders of magnitude. We show that in some instances, our method trained with sparse labels on a video sequence is able to outperform a standard learning procedure trained with the fully labeled sequence.
منابع مشابه
Evaluation of Midwifery Student's Attitude, Performance and Satisfaction from teaching clinical skills with the Video in Hamedan School of Nursing and Midwifery (2019)
1. Duncan I, Yarwood-Ross L, Haigh C..YouTube as a source of clinical skills education. Nurse Eduction. .2013; 33 (12): 1576–1580 2. Arguel ., Jamet E. Using video and static pictures to improve learning of procedural contents.Comput. Hum. Behav.2008; 25 (2):354–359. 3. Johnson N, List-Ivankovic J, Eboh W, Ireland ., Adams D, Mowatt E, Martindale S. Research and evidence based pra...
متن کاملInstrument Tracking via Online Learning in Retinal Microsurgery
Robust visual tracking of instruments is an important task in retinal microsurgery. In this context, the instruments are subject to a large variety of appearance changes due to illumination and other changes during a procedure, which makes the task very challenging. Most existing methods require collecting a sufficient amount of labelled data and yet perform poorly in handling appearance change...
متن کاملSelf-Learning for Player Localization in Sports Video
This paper introduces a novel self-learning framework that automates the label acquisition process for improving models for detecting players in broadcast footage of sports games. Unlike most previous self-learning approaches for improving appearance-based object detectors from videos, we allow an unknown, unconstrained number of target objects in a more generalized video sequence with non-stat...
متن کاملVideo Object Segmentation using Tracked Object Proposals
We present an approach to semi-supervised video object segmentation, in the context of the DAVIS 2017 [8] challenge. Our approach combines category-based object detection, category-independent object appearance segmentation and temporal object tracking. We are motivated by the fact that the objects semantic category tends not to change throughout the video while its appearance and location can ...
متن کاملConnectionist Temporal Modeling for Weakly Supervised Action Labeling
We propose a weakly-supervised framework for action labeling in video, where only the order of occurring actions is required during training time. The key challenge is that the per-frame alignments between the input (video) and label (action) sequences are unknown during training. We address this by introducing the Extended Connectionist Temporal Classification (ECTC) framework to efficiently e...
متن کامل